The Assignment Of Grammatical Relations In Natural Language Processing
نویسندگان
چکیده
One of the main goals of an interpreter is to map the syntactic descriptions found in the sentence into the correct roles that the elements (described by the nominals) play in the situation at hand (described by the verb). For instance, we must be able to state that in 1) The cat ate the mouse the cat is the "eater" and the mouse is the "eaten thing". Of course, if we only talk about roles and situations we miss some significant generalizations. In 2) The boy drank the water, if we say that the boy is the "drinker" and the water is the "drunk thing", we disregard the evident similarity of the roles of "eater" and "drinker" in the two situations. The notion of deep case arises as the common ground underlying a number of "apparently" different roles. Upon this notion some frameworks, that stand at the core of semantic representation and natural language processing, are built (see [Fillmore 68], [Bruce 75] and ISomers 871). The hard task is to devise a mapping between the surface descriptions and these deep cases. The complexity of some syntactic phenomena, like passivization, subject and object raising, long distance dependencies, has led many researchers to pose an intermediate level between the linear string of words and the case system. The concept involved is that of "grammatical relation", such as "subject", "direct object", "indirect object". It is claimed, for example, that "passivizatiou" is universally (cross-linguistically) explained if one says that the "object" of an active sentence becomes the "subject" in the passive form, rather than by saying that the NP in the VP is moved to replace the NP in S (that is a direct mapping). In the latter case it is implicit that the partictdar language under examination has a SubjectVerb-Object structure (SVO), as it usually happens in configurational languages such as English. In the example 3a) Lo hanno visto gli amici di Piero (Him &tve seen the friends of Piero) 3b) E' stato visto dagli amici di Piero ((He) has been seen by l'iero's friends) the passive form does not obey tile law of direct mapping. The example is, however, easily accounted fbr by the relational theories. The passivization rule induces only changes of function: the SUBJ becomes the BYcomplement and the OBJ becomes the SUBJ. The importance of grammatical relations, taken as primitives for a universal grammar, is stated by a number of formalisms often collected under the label of Relational Grammar. The problem is to map the surface constituents into their correct roles. With languages as Italian, which stands in the middle between configurational and freely ordered languages [Stock 891 some flexibility is required to accomplish this task. One possibility is to adopt 11 neutral syntactic structure, open to several alternatives in the interpretation process. The head & modifier approach seems to feature this kind of neutrality, and has effectively been used for dealing with free word order languages, like the Slavonic languages [Sgall et al. 861 and Finnish [Jappinen et al. 86]. The dependency formalism we have adopted is presented in [Lesmo, Lombardo 91]. An example is reported in fig.l, and concerns the sentence: 4) La ragazza ebe lavora al guardaroba fu p e r s u a s a da un c l i en te a comprare una enciclopedia (The girl who works at the wardrobe was persuaded by a customer to buy an encyclopedia). The daughter nodes that stand on the left of their head precede it in the linear order of the sentence, while daughter nodes on the right follow it. The arcs that link the nodes in the dependency tree are of three types: arcs of structural and logical dependency (D&S arcs, represented by bold arrows in the figure), arcs of only structural dependency (STR arcs, simple arrows in the figure), and arcs of only logical dependency (DEP arcs, dashed arrows in the figure). D&S arcs link two words that stand in a "both structural and logical" relation. STR and DEP split these two functions of arc: an STR individuatcs a purely superficial
منابع مشابه
Using Semantic Relations with World Knowledge for Question Answering
Two research directions are to be explored in realizing our group’s TREC QA system in 2006. The first one is to investigate the possibilities of applying linguistically sophisticated grammatical framework in tackling the realworld natural language processing task such as question answering. The other is to exploit the possible world’s entities and relations as described in online encyclopedia i...
متن کاملLearning Transformation Rules to Find Grammatical Relations
Appears in Computational Natural Language Learning (CoNLL-99), pages 43-52. A workshop at the 9th Conf. of the European Chapter of the Assoc. for Computational Linguistics (EACL-99). Bergen, Norway, June, 1999. cs.CL/9906015 Grammatical relationships are an important level of natural language processing. We present a trainable approach to find these relationships through transformation sequence...
متن کاملUsing decision trees to select the grammatical relation of a noun phrase
Abs t rac t We present a machine-learning approach to modeling the distribution of noun phrases (NPs) within clauses with respect to a finegrained taxonomy of grammatical relations. We demonstrate that a cluster of superficial linguistic features can function as a proxy for more abstract discourse features that are not observable using state-of-the-art natural language processing. The models co...
متن کاملUsing Existing Systems to Supplement Small Amounts of Annotated Grammatical Relations Training Data
Grammatical relationships (GRs) form an important level of natural language processing, but di erent sets of GRs are useful for di erent purposes. Therefore, one may often only have time to obtain a small training corpus with the desired GR annotations. To boost the performance from using such a small training corpus on a transformation rule learner, we use existing systems that nd related type...
متن کاملبازشناسی متون فارسی با استفاده از مدل زبانی n-gram و پالایش گرامری
Abstract Text recognition has been one of the growing research topics in recent years. Many of these researches have focused on recognition of letters and sub-words as a basis for identifying larger text structures such as words, phrases and sentences. This thesis presents a new method in which the recognized sub-words are combined in order to provide meaningful words and sentences in Farsi tex...
متن کاملBehavioral Model Generation from Use Cases Based on Ontology Mapping and GRASP Patterns
This paper contributes a new approach for developing UML software designs from Natural Language (NL), making use of a meta-domain oriented ontology, well established software design principles and Natural Language Processing (NLP) tools. In the approach described here, banks of grammatical rules are used to assign event flows from essential use cases. A domain specific ontology is also construc...
متن کامل